Single channel speech separation using maximum a posteriori estimation

نویسندگان

  • Mohammad H. Radfar
  • Richard M. Dansereau
چکیده

We present a new approach for separating two speech signals when only a single recording of their additive mixture is available. In this approach, log spectra of the sources are estimated using maximum a posteriori estimation given the mixture’s log spectrum and the probability density functions of the sources. It is shown that the estimation leads to a two-state, non-linear filter whose states are controlled by the means of the sources. The first state of the filter is expressed using a combination of two Wiener filters whose parameters are controlled by the means and variances of the sources and noise variance and the second state is expressed by the means of the sources. Through the experiments, conducted on a wide variety of mixtures, we show that the MAP based estimator outperforms the methods which use binary mask filtering or Wiener filtering for the separation task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Noise Reduction by Maximum a Posteriori Spectral Amplitude Estimation with Supergaussian Speech Modeling

ESTIMATION WITH SUPERGAUSSIAN SPEECH MODELING Thomas Lotter and Peter Vary Institute of Communication Systems and Data Processing ( ) Aachen University (RWTH), Templergraben 55, D-52056 Aachen, Germany E-mail: lotter vary @ind.rwth-aachen.de ABSTRACT This contribution presents a spectral amplitude estimator for acoustical background noise suppression based on maximum a posteriori estimation and...

متن کامل

Single Channel Audio Source Separation

-Blind source separation is an advanced statistical tool that has found widespread use in many signal processing applications. However, the crux topic based on one channel audio source separation has not fully developed to enable its way to laboratory implementation. The main idea approach to single channel blind source separation is based on exploiting the inherent time structure of sources kn...

متن کامل

A Generalized Approach for Model-based Speaker-dependent Single Channel Speech Separation

Abstract– In this paper, we present a new technique for separating two speech signals received from one microphone or one communication channel. In this special case, the separation problem is too ill-conditioned to be handled with common blind source separation techniques. The proposed technique is a generalized approach to model-based speaker-dependent single channel speech separation techniq...

متن کامل

Dynamic channel compensation based on maximum a posteriori estimation

The degradation of speech recognition performance in real-life environments and through transmission channels is a main embarrassment for many speech-based applications around the world, especially when non-stationary noise and changing channel exist. In this paper, we extend our previous works on Maximum-Likelihood (ML) dynamic channel compensation by introducing a phone-conditioned prior stat...

متن کامل

Title of Document : MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING

Title of Document: MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING Vijay Mahadevan, Master of Science, 2010 Directed By: Dr. Carol Y. Espy-Wilson Department of Electrical and Computer Engineering The aim of the work presented in this thesis is to automatically extract the fundamental frequency of a periodic signal from noisy observations, a task commonly referred to as pitch estim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007